notebook.community

Chapter 2 Multi-arm Bandits

Evaluative feedback is the basis of methods for function optimization, including evolutionary methods.

2.1 A k-armed Bandit Problem

Content source: wangzhe3224/notebook

Similar notebooks:

notebook.community | gallery | about